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Description 

BACKGROUND OF THE INVENTION: 

1. Field of the Invention. 

The invention relates to the field of filtering of video 
signals for a raster scanned display, particularly one em- 
ploying computer generated pixel data. 

2. Prior Art. 

Most cathode ray tube (CRT) computer video dis- 
plays are formed with a raster scan. Many of the stand- 
ards used with these displays can be traced to television 
standards. For example, two interlaced fields are fre- 
quently used to form a frame. Many early personal com- 
puters provided compatible NTSC signals to permit a 
user to use low cost television receivers. In other in- 
stances, computers generate signals such as overlays 
which are used in conjunction with NTSC signals. Thus, 
personal computers often generate pixel data for use on 
interlaced, raster-scanned displays. 

Computer generated data has some characteristics 
which make it less desirable for an interlaced, raster- 
scanned display than video signals originating in a video 
camera. For example, pixel data can exhibit changes (e. 
g., amplitude) over its entire range from pixel-to-pixel. 
That is, virtually any change in pixel data can occur from 
one pixel to the next. In contrast, video data from a tra- 
ditional video camera uses a beam spot which encom- 
passes more than a single pixel area. The data inter- 
preted for a single pixel in this case takes into account 
to some extent the intensity and color of the surrounding 
area. Therefore, there is a softening, even a blurring, 
that occurs as the beam scans the image in a camera. 

The human visual system is an edge-detection sys- 
tem. The eyes are very good at finding contours that de- 
lineate shapes. To give an example, when displaying a 
sequence of adjacent gray bars of increasing density on 
a computer display, the edges between the bars seem 
emphasized. Perceptually the gray bars do not look like 
solid colors, but rather they look like they have been 
shaded between their edges. In other words, the border 
between the gray bars appear enhanced by the edge- 
detection mechanisms of the eye. 

When a typical real world scene is displayed on an 
interlaced display, there are no abrupt transitions from 
one scan line to the next. Objects generally do not have 
very hard edges, and those that do usually do not have 
edges lined up with a scan line. The result is the eye 
cannot find an edge from one scan line to the next. If the 
eye cannot find an edge between one scan line and the 
next, it cannot distinguish between lines. In an interlaced 
display a complete frame is drawn each 1/30th of a sec- 
ond, however, because of the interlacing each 1 /60th of 
a second, either a given scan line or the next scan line 
is flashed. The eye perceives these multiple scan lines 



as thick single lines flashing at a 60 frame/second rate 
even though they are in fact flashing at 30 frames/sec- 
ond. By this model, close viewing of an interlaced dis- 
play should result in perception of flicker at 30 frames/ 
5 second. 

This is in fact what happens; if one is close enough 
to view individual scan lines on a NTSC television, in- 
terlace flicker (i.e., 30 frame/second flashing) is seen, 
even with a real world image. 
10 in the case of a computer generated image such as 
a MACINTOSH computer image on a interlace display, 
virtually every place where there is other than solid white 
or solid black there are abrupt transitions in the vertical 
dimension. 

15 (Macintosh is a registered trademark of Apple Compu- 
ter, Inc.) In the case of the "racing stripes" (alternately 
black and white horizontal lines) on the top of a typical 
Macintosh window, there is the most abrupt transition 
possible, black to white, stretched across the length of 

20 the window and repeated for several lines. Here, it is 
easy for the human eye to detect the edge from one scan 
line to the next, so it considers the scan lines as individ- 
uals, flashing at 30 frames/second. The visual percep- 
tion of the human observer is that where there are abrupt 

25 transitions on the display, the NTSC image flickers no- 
ticeably enough to be distracting. 

One additional subtlety is worth mentioning. The 
human eye will see flicker display wherever there are 
transitions (i.e. .edges) in the vertical dimension. But, the 

30 degree of flicker is not uniform for each type of graphic 
pattern. The worst pattern is the racing stripes across 
the top of a window, mentioned above. Text and other 
random patterns flicker as well, but not nearly as severe- 
ly. This is accounted for by the fact that it is easier to 

35 discern vertical edges where there is a high horizontal 
correlation to the pattern (as in the case of the racing 
stripes), but harder to find the edges when there is a low 
horizontal correlation (as in the case of text). 
As will be seen, since the present invention provides 

40 adaptive filtering for the subtlety.) 

Numerous prior art techniques are known including 
those employing anti-aliasing filters for removing this 
flicker. In some cases, fiiters duplicate the softening ef- 
fects of the camera beam, that is, pixel data for a cluster 

45 or spot of pixels is "averaged" or "convolved" to produce 
filtered pixel data. In general, these techniques require 
considerable computational overhead. 

US-A-4 215414 teaches the use of both vertical and 
horizontal filtering of pixel data to provide the blurring, 

so a desirable characteristic inherent in a TV camera, and 
in particular a two-line convolution for phosphorous 
tubes. Alternate embodiments are also described, in- 
cluding the filtering of a 3x3 array of pixels or of a 3x2 
array. 

55 However, this reference recommends to store an 
entire line in a delay line, which is costly and therefore 
disadvantageous. 

Further, to the extent that it does discuss vertical 
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filtering without delay lines, the reference provides the 
teaching of multiplexed reading of pixel data and then 
demultiplexing for presentation to an adder. 

However, there is no suggestion in the description 
of this reference on how to obtain a practical system for 
vertical filtering without delay lines. For instance, it is far 
from obvious to implement the multiplexing and the de- 
multiplexing with ordinary DRAMs, typically used in a 
frame buffer. This would require reading data from the 
memory at a rate too fast to enable providing a real time 
quality display. For example, if one assumes a dot clock 
rate of 12MHz and 3 accesses per pixel (for a pixel in 
three different lines), this requires an access time of ap- 
proximately 28 nanoseconds. This access time is not 
achievable with ordinary DRAMs. If one were to select 
VRAMs with their greater bandwith, then the multiplex- 
ing and demultiplexing would not work since one could 
not arbitrarily pick the needed infonnation from the bit 
streams. 

Therefore, US-A-4 215 414 simply does not provide 
any teaching for vertical filtering which can be practically 
implemented. 

The present invention seeks to provide a method 
for providing filtered pixel data (in the vertical direction 
only) by means of a convolution technique which is ca- 
pable of being performed "on the fly", by having compu- 
tational demands which are substantially less than that 
required by the prior art systems. 

SUMMARY OF THE INVENTION 

The present invention provides a method as defined 
in claim 1 . Preferred aspects of the invention are defined 
in the dependent claims. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 is a general block diagram illustrating the 
general placement of the present invention in a video 
system. 

Figure 2 is a diagram used to illustrate a method 
used by the present invention to read data from a frame 
buffer. 

Figure 3 is a diagram used to illustrate an alternate 
method used by the present invention to read data from 
a frame buffer. 

Figure 4 is a block diagram illustrating an embodi- 
ment of a convolver used in the present invention 

Figure 5 is a block diagram illustrating another em- 
bodiment of a convolver used in the present invention. 

Figure 6 is a block diagram illustrating another 
method for obtaining convolved data particularly useful 
where not many bits are stored for each pixel. 

Figure 7A is a block diagram of a general prescaler 
which can be used with the convolver of the present in- 
vention. 

Figure 7B is a block diagram of another prescaler 
which can be used with the convolver of the present in- 



vention. 

Figure 8 illustrates a first step in implementing the 
present invention in software for a "chunky" frame buff- 
er. 

5 Figure 9 illustrates a second step in the implemen- 
tation described in conjunction with Figure 8. 

Figure 10 illustrates a third step in the implementa- 
tion described in conjunction with Figures 8 and 9. 

Figure 11 illustrates a fourth step in the implemen- 
io tation described in conjunction with Figures 8-10. 

Figure 12 illustrates a fifth step in the implementa- 
tion described in conjunction with Figures 8-11. 

Figure 13 illustrates gray values loaded into the 
color lookup table. 

15 

DETAILED DESCRIPTION OF THE PRESENT 
INVENTION 

A method for providing filtering data, in a raster- 
20 scanned video apparatus is described. The invention 
provides filtering in the vertical direction (perpendicular 
to the direction of the scan lines). In the following de- 
scription numerous specific details are set forth in order 
to provide a better understanding of the present inven- 
ts tion. It will be obvious, however, to one skilled in the art 
that the present invention may be practiced without 
these details. In other instances, well-known circuits and 
computer operations have been shown in block diagram 
form, in order not to obscure the present invention in 
so unnecessary detail. 

OVERVIEW OF THE PRESENT INVENTION 

Referring first to Figure 1 a frame buffer 10 is illus- 

35 trated which in the presently preferred embodiment may 
be an ordinary frame buffer, for example, one fabricated 
from dynamic random-access memories (DRAMS) or 
video random access memories (VRAMs). Most often, 
data is organized in the frame buffer by scan lines with 

40 data being stored for each pixel along each scan line. 
In some cases, the pixel data is organized in planes 
such that the pixel data for a given pixel is stored with 
each bit on a different one of the planes. When data is 
organized in this arrangement, a scanning address 

45 causes a bit from each plane to be read from the mem- 
ory, the bits are assembled to form a pixel, and hence, 
the data for a given pixel is read from the memory for a 
video display. (Often when writing data to a memory or- 
ganized by planes, an 8 or 16 bit word is written to each 

so plane; this is particularly useful for a black and white, or 
two-color display, where only a single bit is stored per 
pixel and hence data is written into only a single plane). 
For some embodiments of the present invention, the da- 
ta is stored in an ordinary manner as described above. 

55 An address generator 1 1 is used to address the data 
in the buffer to provide an output signal for a video dis- 
play. As will be seen with the present invention the order 
in which the data is scanned from the buffer is different 
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than that used in the prior art and hence, the address 
generator 11 provides this unique addressing order. 
(This is referred to as "kernel-scanned" in Figure 1 .) The 
specific order employed will become apparent, particu- 
larly from the discussion below for Figures 2 and 3. Or-; 
dinary circuits may be used to implement the generator 
11 and provide the order described in Figures 2 and 3. 
As in the case with prior art generators, the address gen- 
erator 11 is generally synchronized with a dot clock. 

The output from the buffer 10 is convolved by con- 
volver 12. The output of the convolver 12 is pixel data 
which can be used in an ordinary manner for a video 
display. The convolver 12 is described in conjunction 
with Figures 7A and 7B. 

In the currently preferred embodiment the output of 
the convolver 1 2 is gamma corrected. Such gamma cor- 
rection is well-known in the art and used to compensate 
for the non-linear light intensity curve of CRT displays. 
The digital information on line 14 is converted to an an- 
alog form for coupling to a display. 

In the following description it is assumed that the 
buffer 1 0 stores pixel data. It will be appreciated that the 
buffer may store pointers to another memory such as a 
color lookup table. In this event, the output of the buffer 
10 is coupled to a color lookup table and the output of 
the color lookup table is coupled to the convolver 12. 

In Figure 2 it is assumed that each of the blocks in 
the illustrated grid represents a pixel in a bit mapped 
buffer. In the horizontal direction the pixels are num- 
bered from 0 through 9; it will be appreciated that in a 
typical memory, many more pixels are used in the dis- 
play. In the vertical direction the rows of pixel data are 
numbered by scan line beginning with line 0, and ending 
at line 5. Again, it will be appreciated that in a typical 
display many more scan lines are used. Figure 2 thus 
represents the data organization that is found in a typical 
frame buffer. 

For the present invention the data for a given pixel 
(e.g., pixel 0) is read from the memory for several lines 
(e.g., lines 1, 2 and 3) before the pixel data for pixel 1 
is read from the memory. The pixel data for the several 
lines of a given pixel is convolved to provide the pixel 
data used by the display. 

More specifically, in Figure 2 the data at locations 
16, 17 and 18 is read from the memory before the data 
for pixel 1 9 is read from the memory. The pixel data from 
locations 16,17 and 1 8 is then convolved to provide pix- 
el data for pixel 0 of a display line. Next, the pixel data 
at locations 19, 20 and 21 is read from the memory and 
convolved to provide pixel data for pixel 1 of the display 
line. This process continues for each of the pixels 0 
through 9 for scan lines 1-3 to provide pixel data for a 
given display line. 

For the illustrated embodiment three lines of data 
are used in the convolution process. Any number of lines 
may in fact be used, for example, the data from lines n, 
n+1 , n+2 ... N+n may be first used to provide pixel data 
for a first display line. Following this, the data from lines 



n+1, n+2, n+3 ... n+N+1 is used to provide the pixel data 
for a second display line. However, the data is used from 
the buffer such that all the pixel data for, for instance, 
pixel M is read for all the scan lines being used in the 

5 convolution before the pixel data for pixel M+1 is read 
from the buffer. 

In some cases, the addressing and mapping 
scheme used for a frame buffer provides more than the 
data for a single pixel for each address. As illustrated in 

10 Figure 3 assume that a single address provides the pixel 
data for pixel 0 and pixel 1 of line 1 , this is shown by the 
enclosing line 23. With the present invention, the data 
associated with line 23 is first read from the memory, 
followed by the convolution of data associated with lines 

is 24 and 25. Then convolution is performed on the data 
for pixel 0 of lines 1 , 2 and 3 for pixel 0, followed by the 
data for pixel 1 , for lines 1,2,3. Now the data associated 
with lines 26, 27 and 28 is read from the memory, and 
so on. 

20 in a specially organized frame buffer such as that 
described in a presently preferred embodiment, a single 
address provides the data for several lines in the buffer. 
For example, a single address may provide the data as- 
sociated with lines 23, 24 and 25. In this event, the first 

25 data for pixel 0 is convolved, then that of pixel 1 is con- 
volved. After that the data associated with lines 26, 27 
and 28 is read from the memory and the data for pixel 
2, and then 3 is convolved. This process continues for 
ail the data along the line for scan lines 1 , 2 and 3. 

30 Thus, in general, the data for a first pixel for scan 
lines n, n+1, n+2 ... n+N is read from the buffer before 
the pixel data for subsequent pixels on these scan lines 
is read from the buffer. This data is then convolved to 
provide pixel data for a single pixel. This process is re- 

35 peated for each of the pixels along the scan lines n, n+1 , 
n+2 ... n+N. Following this the data for a first pixel along 
lines n+1, n+2... n+N+1 is read from the buffer, again 
before the pixel data for subsequent pixels along these 
lines. This data is then convolved to provide the filtered 

40 pixel data for the first pixel of the next display line. This 
process is repeated until vertical filtered data is provided 
for the entire display. 

EMBODIMENTS OF THE CONVOLVER 

45 

As mentioned above, the pixel data trom "N+1 " lines 
of data may be convolved. In the currently preferred em- 
bodiment N=2. (There is a discussion of convolution for 
other kernels later in this application.) In this case, N 
so implements the equation : 

aP 1 +bP 2 +aP 3 
2a+b 

55 

where P 1 is the pixel data for the first pixel of the nth 
scan line, P2 the pixel data for the first pixel of the nth+1 
line, and pixel 3 the pixel data for the n+2 scan line, "a" 
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and "b" are constants with "b" usually being greater than 
"a". In a typical application a=1 and b=2. 

In Figure 4, the convolver (corresponding to kernel 
convolver 12 in Figure 1 ) includes a prescaler 32 which 
receives the input pixel data trom the buffer. The amount 
of prescaling performed by prescaler 32 is controlled by 
the output of the coefficient table 33. The output of table 

33 is controlled by the current cycle number which will 
be discussed. The output of the prescaler 32 provides 
one input to an adder 34. The other input to the adder 
34, in effect is the output of the adder 34 after being cou- 
pled through the latch 35 and the multiplexer 31. The 
multiplexer 31 either provides as an input to the adder 

34 the output of the latch 35 or the value 0. As will be 
seen at "cycle 0", the 0 input is provided to the adder 
34, otherwise the contents of the latch 35 is the input to 
the adder 34. The contents of the latch 35 is normalized 
by a normalizer 36, the amount of normalization, typi- 
cally a constant, is shown as normalization value 37. 
The output of the normalizer 36 is latched by latch 38, 
and the contents of this latch provide the pixel data for 
a pixel along a display line. 

In practice, the prescaler is simply a digital shifter 
that provides digital multiplication by a factor of 1 or 2 
and the normalizer 36 is another digital shifter which per- 
forms division by shifting the digital data by, for example, 
2 places for division by four. 

Assume first that a=1 and b=2 in the equation dis- 
cussed above. Further assume that data is being 
scanned from a buffer in the manner illustrated and de- 
scribed in conjunction with Figure 2. The convolver can 
be seen to operate in a clock cycle sequence. During a 
cycle 0 the data associated with circle 16 is coupled to 
the prescaler 32. Cycle number 0 when applied to the 
coefficient table 33 causes the prescaler 32 to multiply 
this data by one, hence, the data is directly coupled to 
the adder 34. The cycle 0 coupled to the multiplexer 31 
selects the zero input to the adder; therefore, 0 is added 
to the data associated with circle 1 6. This data is simply 
latched within latch 35 under control of the pixel clock. 
Next the data associated with circle 1 7 is coupled to the 
prescaler 32 on cycle 1. The cycle 1 input to the table 

33 causes the prescaler to multiply this data by 2 (a left- 
shift of one) before coupling it to the adder 34. At the 
same time the output of the latch 35 is coupled through 
the multiplexer 31 and is added to the output of the pres- 
caler 32. Hence, the sum P1 +2P2 is formed and cou- 
pled to the latch 35. Following this on cycle 2 the data 
associated with circle 18 is coupled to the prescaler 32. 
The cycle number "2" coupled to table 33 causes this 
data to be directly coupled to the adder 34. The adder 

34 adds this data to the data contained within latch 35 
forming the sum P1 +2P2+P3. This sum is latched within 
latch 35 and then normalized by normalizer 36. For the 
described embodiment, normalizer 36 divides the data 
by a factor of 4 (a right-shift by 2) forming the final equa- 
tion fo^sif^.. The resultant pixel data is latched in latch 
38. On cycle 0 this data may be read from the latch 38 



while new data for the next pixel is being coupled to the 
prescaler 32. 

A fourth cycle may be used (i.e., cycle 3), in which 
event cycle 3 can control latch 38 with no data being 

s shifted into the prescaler 32 during cycle 3. This can be 
used if 3 cycle timing is inconvenient. 

An alternate convolver is illustrated in Figure 5. In 
this embodiment, an adder 40 receives as a first input, 
the output of the prescaler 43. Once again, the prescaler 

10 43 receives the pixel data from the buffer. The amount 
of prescaling of prescaler 43 is controlled by the coeffi- 
cient table 44. The output of table 44 is controlled by the 
cycle number coupled to the table. The other input ter- 
nunal of the adder 40 receives the output of the latch 

15 42. The input to the latch is the output of the multiplexer 

41. Multiplexer 41 selects either the output of the pres- 
caler 43 or the output of the adder 40. The multiplexer 
41 is controlled by the cycle 0 signal; for cycle 0 multi- 
plexer 41 selects the output of the prescaler 43, other- 

20 wise it selects the output of the adder. The output ot the 
latch 42 is coupled to a normalizer 46. The amount of 
normalization is controlled by the values shown as "nor- 
malization value 45". The output of the normalizer 45 is 
coupled to a latch 47. The output of the latch 47 provides 

25 the filtered pixel data. The circuit of Figure 5 performs 
the same convolution as the circuit of Figure 4. 

Assume that the data for line n for pixel 0 is coupled 
to the prescaler 43. During the cycle 0 the multiplexer 
41 selects the output of the prescaler 43 and couples 

30 the data into the latch 42. The prescaler 43 does not 
scale the data, because a=1 in the equation discussed 
above. The data for pixel 0 of line n+1 is prescaled by 2 
and this data is then added to the contents of the latch 
with the sum being coupled to the multiplexer 41 and 

35 latched in latch 42. The process continues until the sum 
aP1 +2P 2 +aP 3 is formed, computed and stored in latch 

42. The normalizer 46 divides this sum by a factor of 4 
and the resultant normalized value is coupled into the 
latch 47. Again, on cycle 0 (the start of new data into the 

40 prescaler 43 for the next pixel) the data is clocked from 
the latch thereby providing the filtered pixel data for the 
display. Once again, a four cycle scheme may be used 
with the fourth cycle (cycle 3) controlling latch 47. 

In Figure 7A a general prescaler is shown compris- 
es ing a multiplier 50. The input pixel data is coupled to the 
multiplier, the output of the multiplier provides the scaled 
pixel data. The amount of multiplication is controlled by 
the output of the coefficient lookup table 51 . This output 
is determined by the cycle number. The cycle number 
50 (e.g., 1,2, 3 . . . ) selects the amount of multiplication re- 
quired for the convolution being used and thereby con- 
trols the amount of multiplication performed by the mul- 
tiplier 50. 

Figure 7B illustrates a prescaler which may be used 
55 when the multiplication used by the convolution step in- 
volves multiplication by one or by two. In this case a mul- 
tiplexer 53 receives the input pixel data at one terminal 
and the input pixel data multiplied by two (i.e., left-shift- 
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ed by 1 with a zero filling in on the right) at its other ter- 
minal. The cycle number requiring the "x2" pixel data is 
used to select the "0" input to the multiplexer 53 and 
thus provides the needed scaled input pixel data. 

The convolvers discussed above are particularly 
good for a serial kernel data stream. Figure 6 illustrates 
a convolver implemented in a table 71 which can be 
used for a parallel data stream. It is particularly useful 
when a limited number of bits are used; for example, in 
a 1 bit/pixel display where the 1-2-1 kernel is used. The 
results of the convolution arithmetic are precomputed 
and placed in the table. This is used as will be seen for 
software embodiments of the invention where the color 
lookup table is preloaded for use as a convolution 
lookup table. 

SOFTWARE EMBODIMENT OF THE PRESENT 
INVENTION 

The method of the present invention can be readily 
implemented in software to provide real time convolu- 
tion. An embodiment of the invention is described below 
for a "chunky" frame buffer. 

With a chunky frame buffer, all the bits for a given 
pixel are stored as adjacent bits of a memory word. For 
example, if color depth is 4 bits per pixel, and the CPU 
word size is 32 bits, then 8 pixels are stored in each CPU 
word. Unlike the planar frame buffer, a given CPU ac- 
cess will always access all the bits in a given pixels, and 
in some cases, the bits in adjacent pixels. Chunky frame 
buffers are also used in commercially available comput- 
ers such as Apple Computer, Inc.'s Macintosh II com- 
puter. 

SOFTWARE EMBODIMENT FOR THE CHUNKY 
FRAME BUFFER 

In this embodiment, a one bit per pixel real-time 
convolution with a chunky frame buffer is realized. Un- 
like a method for a planar frame buffer, the exact number 
of bits per pixel cannot be obtained when rearranging 
the data, hence, the next power of 2 greater than the 
number of bits needed is used. For the described em- 
bodiment, three-lines are used for the convolution and 
hence, four bits of pixel data are stored in a buffer for 
each pixel. The method described below places the bits 
in their proper position. 

First, it should be noted that a one bit per pixel frame 
buffer "off screen" in RAM is used by the CPU for draw- 
ing. This frame buffer is separate from the four bit per 
pixel frame buffer that is actually scanned to provide the 
display. The method described below reads data from 
the one bit per pixel frame buffer, expands the data to 
the four bits per pixel, then writes the data into the four 
bit per pixel frame buffer. The method merges the pixel 
information from the two previous lines before it writes 
the results into the four bit per pixel frame buffer. When 
the four bit pixel is presented to the color lookup table, 



the three bits for lines n-1, n and n+1 are available to 
lookup the proper gray scale for the 1-2-1 convolution. 
Again, as with the previous embodiment, the color 
lookup table is loaded with gray scale information to pro- 
5 vide the convolution. (Three of the four bits read from 
the four bit per pixel frame buffer are used by the CLUT 
to provide the output convolved signal for the display.) 

Step 0 

10 

Four 32-bit words (A, B, C, and D) are initialized to 
zero. (A, B, C, and D each refer to 32-bit registers within 
the CPU.) A 32-bit word R is read starting from the left- 
most pixel position of the top scan line of the one bit per 
is pixel frame buffer. A, B, C and D are all stored at adja- 
cent left to right locations starting from the top scan line 
of the four bit per pixel frame buffer. 

Step 1 

20 

R is read from the next 32 bits in the one bit per pixel 
frame buffer immediately below the last 32-bit word read 
from the one bit per pixel frame buffer. This is shown in 
Figure 8 where two words, words 93 and 94, are shown 
25 for lines n and n+1 in the one bit per pixel frame buffer. 

Step 2 

As shown in Figure 9, one byte of R is expanded 
30 jnto a second 32-bit word M such that each of the 8 bits 
is placed at 4 bit intervals in the 32-bit word starting at 
bit 1 (i.e., bit 0 to bit 1 , bit 1 to bit 5, bit 2 to bit 9, etc.) 
and a 1 is placed in every 4th bit starting at bit 0. All 
other bits are set to zero. For example, the byte 0111 
35 0101 is converted to (shown as groups of 4): 0001 0011 
0011 0011 0001 0011 0001 0011. This is done by using 
a 256x32-bit pre-loaded lookup table in RAM. 

Step 3 

40 

A is left-shifted by 1 . In some microprocessors such 
as the Motorola Part No. 68020 this can be accom- 
plished more quickly by adding A to itself. In the upper 
part of Figure 10, A is shown before the shift and in the 
45 lower part of Figure 10 after the shift. 

Step 4 

M is bit-wise ORed into A as shown in Figure 11. 

50 First, this serves to merge the byte from R into A since 
it is known that the bits in A corresponding to the bits 
from the byte from R are all zero (anything ORed with 
zero retains its value). Second, this serves to force every 
4th bit starting with bit 0 in A to one (this sets up for the 

55 merge operation in step 10, below). 
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Step 5 

A is stored in the four bit per pixel frame buffer im- 
mediately below the last place A was stored as shown 
in Figure 12. 

Step 6 

Steps 2 through 4 are repeated for the three other 
byres from R. This time, however, B, C, and D are used 
instead of A. 

Step 7 

R is read for the next 32-bit word in the one bit per 
pixel frame buffer immediately below the last 32-bit word 
as in Step 1 above. 

Step 8 

As shown in Figure 9, one byte of R is expanded 
into M with each of the eight bits placed at 4 bit intervals 
starting at bit 1 . Also, a 0 is placed in every 4th bit start- 
ing at bit 0 and all other bits are set to 1 . For example, 
the byte 0111 0101 would be converted to 1100 1110 
1110 1110 1100 1110 1100 1110. This is accomplished 
by means of a second 256x32-bit pre-loaded lookup ta- 
ble in RAM. 

Step 9 

As shown in Figure 10, A is left-shifted by 1 . Once 
again, as mentioned for step 3, addition of A to itself may 
be used. 

Step 10 

As shown in Figure 11 , M is bit-wise ANDed into A. 
First, this serves to merge the byte from R into A since 
it is known that the bits in A corresponding to the bits 
from the byte from R are all ones (anything ANDed with 
one retains its value). Second, this serves to force every 
4th bit starting with bit 0 in A to zero (this will set up the 
merge operation in step 4, above). 

Step 11 

A is stored in the 4-bit frame buffer immediately be- 
low the last place A was stored. See word 95 of Figure 

12. 

Step 12 

Steps 8 through 10 are repeated for the 3 other 
bytes from R. They are merged in B, C, and D instead 
of A. See words 96, 97 and 98 of Figure 12. 



Step 13 

Steps 1 through 12 are repeated until the bottom of 
the frame buffer is reached, then R is read for the pixels 

s on the top scan line of the 1 bit/pixel frame buffer just to 
the right of where it was loaded at the start of the last 
pass. A, B, C, and D are all stored at adjacent I eft -to- 
right locations on the top scan line of the 4 bit/pixel frame 
buffer just -to the right of where they were loaded at the 

io start of the last pass. 

In summary, the pixels in the 4-bit per pixel frame 
buffer 100 of Figure 12 are coded with line n+1 in bit 1, 
n in bit 2, and n-1 in bit 3 (this resulting bit configuration 
is shown in Figure 1 1 ). Bit 0 is ignored by the CLUT 101 

is of Figure 1 2. The one bit per pixel frame buffer of Figure 
8 is scanned vertically with a new bit added into each 
four bit pixel for each scan line by left shifting the existing 
bit for the pixel by one and merging the new bit into bit 
1 of the 4-bit per pixel word. The shift operation serves 

20 to adjust the pixel from its previous centering on line n- 
1 (the line above) to its current centering on line n. In 
other words, when the operation begins the four bit pixel 
data contains bits from lines n-2, n-1 and n since the 
data was used for the line above. The left shift operation 

25 changes the configuration of the four bits to n-1 , n, and 
a one or a zero in bit 1 (bit 0 is ignored). Then, the new 
bit from the one bit per pixel frame buffer is merged into 
bit one for line n+1 . The new assembled four bit word is 
stored in the four bit per pixel frame buffer and as men- 

30 tioned, the CLUT is used to provide the convolution. 

In detail, the method starts in the upper-left of the 
frame buffer and works down a 32-pixel column. The 
read into R loads the 32 1 bit pixels then each 8 pixels 
of the 32 are operated upon separately. The first 8 pixels 

35 (a byte) are used as a lookup table index to fetch a 32 -bit 
word, M. M holds the 8 pixels, spread out at 4-bit inter- 
vals so that they are ready to merge for the 4 bit/pixel 
frame buffer. 

M also is set up with the rest of its bits prepared for 

40 either a bit-wise AND merge or an OR merge. The rea- 
son it alternates between AND and OR is that it saves 
the step of clearing (or setting) the bits in A which are 
the destination for the 8 pixels from R. Since A will be 
left-shifted just prior to the AND or OR merge, the bit 

45 immediately to the right of the destination of the R bits 
is forced so that at the next step they are already pre- 
pared for merging. AND prepares for the OR by forcing 
zeroes, and OR prepares for the AND by forcing ones. 
A is left-shifted by one to update the pixel from being 

50 centered for the previous line to being centered for the 
current line. The left-shift moves the previous line n+1 
to the current line n and the previous line n to the current 
line n-1 . Previous line n-1 (current line n-2) is shifted out. 
Notice that this shift applies to all eight pixels contained 

55 in the 32 bits of A so it is an 8-way parallel operation. 
Notice also the bits from previous line n-1 shifts into the 
unused bit of the next 4-bit pixel to the left (or off the left 
edge of the 32-bit word). 
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Then, M is merged with A by either an AND or an 
OR. Bits from n and n-1 are left alone, new n+1 bits are 
merged in, and the unused bits are set to known state 
(0 if an AND, 1 if an OR). A is finally stored in the 4 bit/ 
pixel Irame buffer. 

The other 24 pixels in R are handled the same way, 
with 8 pixels each for B, C, and D. 

The same steps are performed for each successive 
scan line below until the bottom of the frame buffer is 
reached. Then, the next column of 32 pixels immediately 
to the right is scanned-down, and so on until the entire 
frame is scanned. 

The CLUT 101 of Figure 12 is loaded in a similar 
manner to that of a planar frame buffer implementation 
shown in Figure 13. The differences are that the bit or- 
dering is different and that since bit 0 in the 4-bit pixels 
is indeterminate (it alternates depending on whether the 
last merge was with an AND or an OR), the same gray 
value for every two CLUT entries is stored. 

OTHER CONVOLUTION KERNELS 

In the previous section, most of the emphasis has 
been on the 1 -2-1 kernel. Experiments have shown that 
neither a 3-line convolution nor on-off-on-off reduction 
of 50% gray is essential in all situations for effective in- 
terlace flicker reduction. It the constraint that on-off-on- 
off horizontal line patterns are reduced to a 50% gray is 
maintained and other kernel sizes are tried other than 1 
x 3, for each kernel size there is one set of coefficients 
to meet the on-off-on-off constraint. These coefficients 
match Pasqual's triangle (i.e., 1 ; 1, 1; 1, 2, 1; 1, 3, 3, 1; 
1, 4, 6, 4, 1; etc.). 

ADAPTIVE CONVOLUTION 

As mentioned above, the worst flicker patterns are 
the ones which have high horizontal coherence (i.e., re- 
peat horizontally). Horizontal solid lines, horizontal 
dashed lines, and gray dither patterns are examples of 
patterns with high horizontal coherence. Text is an ex- 
ample of patterns without such coherence. The convo- 
lution discussed above may be adaptive, that is, it may 
be varied depending on the type of patterns being dis- 
played. First, it is determined whether a repeating pat- 
tern is occurring in a local horizontal group ot kernels, 
for example, 8 pixels across. If there is a pattern in the 
kernels, then all of the kernels are convolved for exam- 
ple, with the 1-2-1 coefficients. If there is no such pat- 
tern, then the 8 pixels are convolved with coefficients 
making a sharper filter (e.g., 1-3-1 or 1-4-1). The test to 
determine whether a pattern is repeating must be ap- 
plied continuously in a moving horizontal window, kernel 
by kernel. Since the test windows overlap, some kernels 
may be part of a pattern in one test window but not in 
another. For these kernels, the 1-2-1 convolution is 
used, since they are at the edge of the patern. Different 
tests may be used for determining whether a pattern is 



being repeated, for example, the left four kernels may 
be compared with the right four kernels within the win- 
dow. 

5 

Claims 

1. A method for providing filtered pixel data for a dis- 
play stored in a raster-scanned video graphics ap- 
paratus having a first memory where pixel data for 
each scan line is stored in adjacent locations in said 
first memory such that each word accessed from 
said first memory includes the data for at least one 
pixel, and a second memory storing first coded pixel 
data of n-1 , n, and n+1 scan lines in adjacent loca- 
tions, where n represents a given scan line in the 
first memory, said method comprising the steps of: 

reading pixel data from scan line n+2 from the 
first memory; 

shifting the first coded pixel data of the second 
memory; 

merging the pixel data from scan line n+2 with 
the shifted first coded pixel data to generate 
second coded pixel data in the second memory 
for n, n+1, and n+2 scan lines; and 
convolving said second coded pixel data to 
generate said filtered pixel data. 

30 2. The method defined by Claim 1 wherein said con- 
volving step comprises the step of performing the 
following computer implemented computation: 

35 aP^+bPg+aPg 

2a+b, 

where is the pixel data for the nth scan line, P 2 
is the pixel data for scan line n + 1 , and P 3 is the 
40 pixel data for scan line n + 2, and wherein a and b 
are constants. 

3. The method as defined by Claim 1 wherein the step 
of convolving comprises the further step of gener- 

45 ating an RGB output signal based on the pixel data. 

4. The method as defined by Claim 3 further compris- 
ing the step of applying the RGB output signal to a 
color monitor to display a representation of the pixel 

50 data. 



PatentansprOche 

55 1 . Verfahren zum Bereitstellen gefilterter Bildpunktda- 
ten fur einen Bildschirm, die in einer Rasterabtast- 
Videographik-Vorrichtung abgelegt sind, die einen 
ersten Speicher hat, wobei Bildpunktdaten fur jede 
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Bildzeile in benachbarten Speicherplatzen in dem 
ersten Speicher abgelegt werden, so daG von dem 
ersten Speicher jedes Wort, auf das zugegriffen 
wird, die Daten f Or mindestens einen Bildpunkt ent- 
halt, und einen zweiten Speicher, der erste kodierte s 
Bildpunktdaten von Bildzeilen n-1, n und n+1 in be- 
nachbarten Speicherplatzen ablegt, wobei n fur ei- 
ne bestimmte Bildzeile in dem ersten Speicher 
stent, wobei das Verfahren die Schritte beinhaltet: 

10 

Lesen von Bildpunktdaten aus Bildzeile n+2 
aus dem ersten Speicher; 
Verschieben der ersten kodierten Bildpunktda- 
ten des zweiten Speichers; 

Vereinigen der Bildpunktdaten aus Bildzeile is 
n+2 mit den verschobenen ersten kodierten 
Bildpunktdaten, um zweite kodierte Bildpunkt- 
daten in dem zweiten Speicher fur Bildzeilen n, 
n+1 und n+2 zu erzeugen; und 
Verknupfen der zweiten kodierten Bildpunktda- 20 
ten, um die gefilterten Bildpunktdaten zu erzeu- 
gen. 

2. Verfahren nach Anspruch 1, bei dem der Verknup- 
fungsschritt den Schritt umfaGt, die folgende com- 25 
puter-implementierte Rechnung durchzufuhren: 

aP 1 +bP 2 +aP 3 

2a+b, 30 

wobei P<, das Bildpunktdatum der n-ten Bildzeile ist, 
P 2 das Bildpunktdatum der Bildzeile n+1 ist, und P 3 
ist das Bildpunktdatum der Bildzeile n+2, und wobei 
a und b Konstanten sind. 35 

3. Verfahren nach Anspruch 1, bei dem der VerknOp- 
fungsschritt ferner den Schritt umfaGt, daG ein auf 
den Bildpunktdaten basierendes RGB-Ausgangssi- 
gnal generiert wird. 40 

4. Verfahren nach Anspruch 3, ferner den Schritt be- 
inhaltend, daG das RGB-Ausgangssignal einem 
Farbbildschirm zugefuhrt wird, um eine Darstellung 
der Bildpunktdaten zu zeigen. <s 



pixel, et une deuxieme mSmoire qui m6morise des 
premieres donn6es cod6es de pixels de lignes de 
balayage n - 1 , n , n + 1 dans des emplacements 
adjacents, ou n repr^sente une ligne de balayage 
donn6e de la premiere memoire, ledit proc6d6 com- 
prenant les Stapes consistant a: 

lire, dans la premiere mSmoire, des donn6es 
de pixels provenant de la ligne n + 2; 
d6caler les premieres donn6es cod6es de 
pixels de la deuxieme m6moire; 
fusionner les donn6es de pixels provenant de 
la ligne de balayage n+2 avec les premieres 
donnSes cod6es d6ca!6es de pixels pour en- 
gendrer des deuxiemes donnSes cod6es de 
pixels dans la deuxieme m6moire pour des li- 
gnes de balayage n, n + 1 , n + 2; et 
effectuer une convolution desdites deuxiemes 
donnSes cod6es pour engendrer lesdites don- 
n6es f iltr6es de pixels. 

2. Le proc6d6 selon la revendication 1 , dans lequel la- 
dite 6tape de convolution comprend PStape consis- 
tant a effectuer le calcul suivant: 

a P 1 + b P 2 + a P 3 
2a7b 

oD Pt est la donn6e de pixel pour la n-ieme 
ligne de balayage, P 2 la donn6e de pixel pour la li- 
gne de balayage n + 1 , et P 3 la donn6e de pixel pour 
la ligne de balayage n + 2; et ou a et b sont des 
constantes. 

3. Le proc6d6 selon la revendication 1 dans lequel la- 
dite 6tape de convolution comprend l'6tape addi- 
tionnelle consistant a engendrer un signal de sortie 
RGB sur la base des donn6es de pixels. 

4. Le procSde" selon la revendication 3 qui comprend 
en outre l'6tape consistant a appliquer le signal de 
sortie RGB a un monitor en couleurs pour afficher 
une repr6sentation des donndes de pixels. 



Revendlcations 

1. Un proc6d6 de g§n6ration de donnSes filtr6es de so 
pixels pour une affichage m6moris6 dans un appa- 
reil graphique vid6o a balayage de trame qui inclut 
une premiere m6moire, des donn^es de pixels pour 
chaque ligne de balayage 6tant m6moris6es dans 
des emplacements adjacents de ladite premiere ss 
m6moire d'une maniere telle que chaque mot 
auquel un acces est effectu6 a partir de ladite pre- 
miere m6moire inclut les donnSes pour au moins un 
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